SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Koji YANAI et al. 

(ii) TITLE OF INVENTION: (3-FRUCTOFURANOSIDASE AND ITS GENE, 
METHOD OF ISOLATING p-FRUCTOFURANOSIDASE GENE, SYSTEM 
FOR PRODUCING p-FRUCTOFURANOSIDASE , AND 
3-FRUCTOFURANOSIDASE VARIANT 

(iii) NUMBER OF SEQUENCES: 35 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Wenderoth, Lind & Ponack, L.L.P. 

(B) STREET: 2033 K Street, N.W. , Suite 800 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S.A. 

(F) ZIP: 20006 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 mb 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: Wordperfect 5.1 
(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: NEW 

(B) FILING DATE: November 23, 2001 
(vii) PRIOR APPLICATION DATA: 
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(A) APPLICATION NUMBER: 09/142,623 

(B) FILING DATE: September 10, 1998 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Lee Cheng 

(B) REGISTRATION NUMBER: 4 0 , 949 

(C) REFERENCE /DOCKET NUMBER: 2001-1611 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-721-8200 

(B) TELEFAX: 202-721-8250 
(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 amino acid residues 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Not relevant 

(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1..635 

(C) IDENTIFICATION METHOD: E 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
Ser Tyr His Leu Asp Thr Thr Ala Pro Pro Pro Thr Asn Leu Ser Thr 

1 5 10 15 

Leu Pro Asn Asn Thr Leu Phe His Val Trp Arg Pro Arg Ala His He 

20 25 30 

Leu Pro Ala Glu Gly Gin He Gly Asp Pro Cys Ala His Tyr Thr Asp 

35 40 45 

Pro Ser Thr Gly Leu Phe His Val Gly Phe Leu His Asp Gly Asp Gly 
50 55 60 

; He Ala Gly Ala Thr Thr Ala Asn Leu Ala Thr Tyr Thr Asp Thr Ser 
■ 65 70 75 80 

' Asp Asn Gly Ser Phe Leu He Gin Pro Gly Gly Lys Asn Asp Pro Val 

85 90 95 

Ala Val Phe Asp Gly Ala Val He Pro Val Gly Val Asn Asn Thr Pro 

100 105 HO 

Thr Leu Leu Tyr Thr Ser Val Ser Phe Leu Pro He His Trp Ser He 

115 120 125 

Pro Tyr Thr Arg Gly Ser Glu Thr Gin Ser Leu Ala Val Ala Arg Asp 

130 135 140 

Gly Gly Arg Arg Phe Asp Lys Leu Asp Gin Gly Pro Val He Ala Asp 
145 150 155 160 

His Pro Phe Ala Val Asp Val Thr Ala Phe Arg Asp Pro Phe Val Phe 

165 170 175 

Arg Ser Ala Lys Leu Asp Val Leu Leu Ser Leu Asp Glu Glu Val Ala 
180 185 190 



Arg Asn Glu Thr Ala Val Gin Gin Ala Val Asp Gly Trp Thr Glu Lys 

195 200 205 

Asn Ala Pro Trp Tyr Val Ala Val Ser Gly Gly Val His Gly Val Gly 

210 215 220 

Pro Ala Gin Phe Leu Tyr Arg Gin Asn Gly Gly Asn Ala Ser Glu Phe 
225 230 235 240 

Gin Tyr Trp Glu Tyr Leu Gly Glu Trp Trp Gin Glu Ala Thr Asn Ser 

245 250 255 

Ser Trp Gly Asp Glu Gly Thr Trp Ala Gly Arg Trp Gly Phe Asn Phe 

260 265 270 

Glu Thr Gly Asn Val Leu Phe Leu Thr Glu Glu Gly His Asp Pro Gin 

275 280 285 

Thr Gly Glu Val Phe Val Thr Leu Gly Thr Glu Gly Ser Gly Leu Pro 

290 295 300 

lie Val Pro Gin Val Ser Ser He His Asp Met Leu Trp Ala Ala Gly 
305 310 315 320 

Glu Val Gly Val Gly Ser Glu Gin Glu Gly Ala Lys Val Glu Phe Ser 

325 330 335 

Pro Ser Met Ala Gly Phe Leu Asp Trp Gly Phe Ser Ala Tyr Ala Ala 

340 345 350 

Ala Gly Lys Val Leu Pro Ala Ser Ser Ala Val Ser Lys Thr Ser Gly 

355 360 365 

Val Glu Val Asp Arg Tyr Val Ser Phe Val Trp Leu Thr Gly Asp Gin 
370 375 380 
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Tyr Glu Gin Ala 
385 

Ser Leu Leu Leu 

Val Asp Asn Glu 
420 

Glu Ser Asp Asn 
435 

Ala Arg Glu Thr 
450 

Glu Glu Asp Arg 
465 

Ser Pro Ser Ser 

Ala Ser Ala Arg 
500 

Ser Glu Leu Glu 
515 

Leu Val Val Asp 
530 

Gly Leu Asp Ser 
545 

lie Glu Asn Gly 



Asp Gly Phe Pro 
390 

Pro Arg Glu Leu 
405 

Leu Val Arg Glu 

Gin Thr Ala Arg 
440 

Lys Ala Ala Leu 
455 

Thr Leu Gin Thr 
470 

Lys Phe Phe Val 
485 

Ser Ser Pro Leu 

Arg Thr Ala lie 
520 

Arg Ser Gin Thr 
535 

Phe Thr Glu Ser 
550 

Gin Glu Gin Val 
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Thr Ala Gin Gin 

395 

Lys Val Gin Thr 
410 

Glu Gly Val Ser 
425 

Leu Arg Thr Leu 

Leu Ala Asn Gly 
460 

Ala Ala Val Val 
475 

Leu Thr Ala Gin 
490 

Gin Ser Gly Phe 
505 

Tyr Tyr Gin Phe 

Ser Ala Ala Ala 
540 

Gly Lys Leu Arg 
555 

Glu Thr Leu Asp 
570 



Gly Trp Thr Gly 
400 

Val Glu Asn Val 
415 

Trp Val Val Gly 
430 

Gly lie Thr lie 
445 

Ser Val Thr Ala 

Pro Phe Ala Gin 
480 

Leu Glu Phe Pro 
495 

Glu lie Leu Ala 
510 

Ser Asn Glu Ser 
525 

Pro Thr Asn Pro 

Leu Phe Asp Val 
560 

Leu Thr Val Val 
575 
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Val Asp Asn Ala Val Val 

580 

Ser Thr Trp Ala Arg Ser 

595 

Phe His Asn Gly Glu Gly 
610 

Glu Gly Leu Tyr Asn Ala 
625 630 



Glu Val Tyr Ala Asn 

585 

Trp Tyr Asp Asn Ser 
600 

Glu Val Gin Phe Arg 
615 

Trp Pro Glu Arg Asn 

635 



Gly Arg Phe Ala Leu 

590 

Thr Gin lie Arg Phe 
605 

Asn Val Ser Val Ser 
620 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1905 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double stranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 

(ix) FEATURE: 

(A) NAME /KEY : mat peptide 

(B) LOCATION: 1 .. 1905 

(C) IDENTIFICATION METHOD : E 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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TCATACCACC TGGACACCAC GGCCCCGCCG CCGACCAACC TCAGCACCCT CCCCAACAAC 60 
ACCCTCTTCC ACGTGTGGCG GCCGCGCGCG CACATCCTGC CCG CCGAGGG CCAGATCGGC 120 
GACCCCTGCG CGCACTACAC CGACCCATCC ACCGGCCTCT TCCACGTGGG GTTCCTGCAC 180 
GACGGGGACG GCATCGCGGG CGCCACCACG GCCAACCTGG CCACCTACAC CGATACCTCC 240 
GATAACGGGA GCTTCCTGAT CCAGCCGGGC GGGAAGAACG ACCCCGTCGC CGTGTTCGAC 3 00 
GGCGCCGTCA TCCCCGTCGG CGTCAACAAC ACCCCCACCT TACTCTACAC CTCCGTCTCC 360 
TTCCTGCCCA TCCACTGGTC CATCCCCTAC ACCCGCGGCA GCGAGACGCA GTCGTTGGCC 420 
GTCGCGCGCG ACGGCGGCCG CCGCTTCGAC AAGCTCGACC AGGGCCCCGT CATCGCCGAC 480 
CACCCCTTCG CCGTCGACGT CACCGCCTTC CGCGATCCGT TTGTCTTCCG CAGTGCCAAG 54 0 
TTGGATGTGC TGCTGTCGTT GGATGAGGAG GTGGCGCGGA ATGAGACGGC CGTGCAGCAG 600 
GCCGTCGATG GCTGGACCGA GAAGAACGCC CCCTGGTATG TCGCGGTCTC TGGCGGGGTG 660 
CACGGCGTCG GGCCCGCGCA GTTCCTCTAC CGCCAGAACG GCGGGAACGC TTCCGAGTTC 720 
CAGTACTGGG AGTACCTCGG GGAGTGGTGG CAGGAGGCGA CCAACTCCAG CTGGGGCGAC 780 
GAGGGCACCT GGGCCGGGCG CTGGGGGTTC AACTTCGAGA CGGGGAATGT GCTCTTCCTC 840 
ACCGAGGAGG GCCATGACCC CCAGACGGGC GAGGTGTTCG TCACCCTCGG CACGGAGGGG 900 
TCTGGCCTGC CAATCGTGCC GCAGGTCTCC AGTATCCACG ATATGCTGTG GGCGGCGGGT 960 
GAGGTCGGGG TGGGCAGTGA GCAGGAGGGT GCCAAGGTCG AGTTCTCCCC CTCCATGGCC 1020 
GGGTTTCTGG ACTGGGGGTT CAGCGCCTAC GCTGCGGCGG GCAAGGTGCT GCCGGCCAGC 1080 
TCGGCGGTGT CGAAGACCAG CGGCGTGGAG GTGGATCGGT ATGTCTCGTT CGTCTGGTTG 1140 
ACGGGCGACC AGTACGAGCA GGCGGACGGG TTCCCCACGG CCCAGCAGGG GTGGACGGGG 1200 
TCGCTGCTGC TGCCGCGCGA GCTGAAGGTG CAGACGGTGG AGAACGTCGT CGACAACGAG 12 60 
CTGGTGCGCG AGGAGGGCGT GTCGTGGGTG GTGGGGGAGT CGGACAACCA GACGGCCAGG 1320 
CTGCGCACGC TGGGGATCAC GATCGCCCGG GAGACCAAGG CGGCCCTGCT GGCCAACGGC 1380 
TCGGTGACCG CGGAGGAGGA CCGCACGCTG CAGACGGCGG CCGTCGTGCC GTTCGCGCAA 1440 
TCGCCGAGCT CCAAGTTCTT CGTGCTGACG GCCCAGCTGG AGTTCCCCGC GAGCGCGCGC 1500 
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TCGTCCCCGC TCCAGTCCGG GTTCGAAATC 
TACTACCAGT TCAGCAACGA GTCGCTGGTC 
CCCACGAACC CCGGGCTGGA TAGCTTTACT 
ATCGAGAACG GCCAGGAGCA GGTCGAGACG 
GTTGTCGAGG TGTATGCCAA CGGGCGCTTT 
GACAACTCCA CCCAGATCCG CTTCTTCCAC 
GTCTCCGTGT CGGAGGGGCT CTATAACGCC 



CTGGCGTCGG AGCTGGAGCG CACGGCCATC 1560 
GTCGACCGCA GCCAGACTAG TGCGGCGGCG 162 0 
GAGTCCGGCA AGTTGCGGTT GTTCGACGTG 1680 
TTGGATCTCA CTGTCGTCGT GGATAACGCG 1740 
GCGTTGAGCA CCTGGGCGAG ATCGTGGTAC 1800 
AACGGCGAGG GCGAGGTGCA GTTCAGGAAT 18 60 
TGGCCGGAGA GAAAT 19 05 



j (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: internal fragment 
(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
Leu Asp Gin Gly Pro Val lie Ala Asp His Pro Phe Ala Val Asp Val 

15 10 15 

Thr Ala Phe Arg 
20 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
Val Glu Phe Ser Pro Ser Met Ala Gly Phe Leu Asp Trp Gly Phe Ser 

15 10 15 

Ala Tyr Ala Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: internal fragment 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2- 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
Val Gin Thr Val Glu Asn Val Val Asp Asn Glu Leu Val Arg Glu Glu 

1 5 10 15 

Gly Val Ser Trp 
20 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2- 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Ala Ala Leu Leu Ala Xaa Gly Ser Val Thr Ala Glu Glu Asp Arg Thr 

1 5 10 15 

Leu Gin Thr Ala 
20 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: N-terminal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
Ser Tyr His Leu Asp Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATCGCSGAYC AYCCSTTYGC 2 0 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TCRTTRTCSA CSACRTTYTC 20 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 788 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double stranded 

(D) TOPOLOGY: Linear 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 

(ix) FEATURE: 

(A) NAME/KEY: P CDS (partial amino acid sequence) 

(B) LOCATION: 1 . . 788 

(C) IDENTIFICATION METHOD: E 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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ATC GCC GAC CAC 
lie Ala Asp His 
1 

TTT GTC TTC CGC 
Phe Val Phe Arg 
20 

GAG GTG GCG CGG 
Glu Val Ala Arg 
35 

ACC GAG AAG AAC 
Thr Glu Lys Asn 
50 

GGC GTC GGG CCC 
Gly Val Gly Pro 
65 

TCC GAG TTC CAG 
Ser Glu Phe Gin 

ACC AAC TCC AGC 
Thr Asn Ser Ser 
100 

TTC AAC TTC GAG 
Phe Asn Phe Glu 
115 



CCC TTC GCC GTC GAC 
Pro Phe Ala Val Asp 
5 

AGT GCC AAG TTG GAT 
Ser Ala Lys Leu Asp 

25 

AAT GAG ACG GCC GTG 
Asn Glu Thr Ala Val 
40 

GCC CCC TGG TAT GTC 
Ala Pro Trp Tyr Val 
55 

GCG CAG TTC CTC TAC 
Ala Gin Phe Leu Tyr 
70 

TAC TGG GAG TAC CTC 
Tyr Trp Glu Tyr Leu 
85 

TGG GGC GAC GAG GGC 
Trp Gly Asp Glu Gly 

105 

ACG GGG AAT GTG CTC 
Thr Gly Asn Val Leu 
120 



GTC ACC GCC TTC 
Val Thr Ala Phe 
10 

GTG CTG CTG TCG 
Val Leu Leu Ser 

CAG CAG GCC GTC 
Gin Gin Ala Val 
45 

GCG GTC TCT GGC 

Ala Val Ser Gly 
60 

CGC CAG AAC GGC 
Arg Gin Asn Gly 
75 

GGG GAG TGG TGG 
Gly Glu Trp Trp 
90 

ACC TGG GCC GGG 
Thr Trp Ala Gly 

TTC CTC ACC GAG 
Phe Leu Thr Glu 
125 



CGC GAT CCG 48 
Arg Asp Pro 
15 

TTG GAT GAG 96 
Leu Asp Glu 
30 

GAT GGC TGG 144 
Asp Gly Trp 

GGG GTG CAC 192 
Gly Val His 

GGG AAC GCT 240 
Gly Asn Ala 
80 

CAG GAG GCG 288 
Gin Glu Ala 
95 

CGC TGG GGG 33 6 

Arg Trp Gly 

110 

GAG GGC CAT 384 
Glu Gly His 
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GAC CCC CAG ACG GGC GAG GTG TTC GTC ACC CTC GGC ACG GAG GGG TCT 432 

Asp Pro Gin Thr Gly Glu Val Phe Val Thr Leu Gly Thr Glu Gly Ser 

130 135 140 

GGC CTG CCA ATC GTG CCG CAG GTC TCC AGT ATC CAC GAT ATG CTG TGG 480 

Gly Leu Pro lie Val Pro Gin Val Ser Ser lie His Asp Met Leu Trp 

145 150 155 160 

GCG GCG GGT GAG GTC GGG GTG GGC AGT GAG CAG GAG GGT GCC AAG GTC 528 

Ala Ala Gly Glu Val Gly Val Gly Ser Glu Gin Glu Gly Ala Lys Val 

165 170 175 

GAG TTC TCC CCC TCC ATG GCC GGG TTT CTG GAC TGG GGG TTC AGC GCC 576 

Glu Phe Ser Pro Ser Met Ala Gly Phe Leu Asp Trp Gly Phe Ser Ala 

180 185 190 

TAC GCT GCG GCG GGC AAG GTG CTG CCG GCC AGC TCG GCG GTG TCG AAG 624 

Tyr Ala Ala Ala Gly Lys Val Leu Pro Ala Ser Ser Ala Val Ser Lys 

195 200 205 

ACC AGC GGC GTG GAG GTG GAT CGG TAT GTC TCG TTC GTC TGG TTG ACG 672 

Thr Ser Gly Val Glu Val Asp Arg Tyr Val Ser Phe Val Trp Leu Thr 

210 215 220 

GGC GAC CAG TAC GAG CAG GCG GAC GGG TTC CCC ACG GCC CAG CAG GGG 720 

Gly Asp Gin Tyr Glu Gin Ala Asp Gly Phe Pro Thr Ala Gin Gin Gly 

225 230 235 240 

TGG ACG GGG TCG CTG CTG CTG CCG CGC GAG CTG AAG GTG CAG ACG GTG 768 

Trp Thr Gly Ser Leu Leu Leu Pro Arg Glu Leu Lys Val Gin Thr Val 

245 250 255 
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GAG AAC GTC GTC GAC AAC GA 
Glu Asn Val Val Asp Asn 
260 



788 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Not relevant 

(ii) MOLECULE TYPE: protein 
(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Penicillium roqueforti IAM7254 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1 565 

(C) IDENTIFICATION METHOD: E 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Val Asp Phe His Thr Pro lie Asp Tyr Asn Ser Ala Pro Pro Asn Leu 

15 10 15 

Ser Thr Leu Ala Asn Ala Ser Leu Phe Lys Thr Trp Arg Pro Arg Ala 

20 25 30 

His Leu Leu Pro Pro Ser Gly Asn He Gly Asp Pro Cys Gly His Tyr 
35 40 45 
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Thr Asp Pro Lys 
50 

lie Ser Gly Ala 
65 

Pro Asp Gly Ala 

Ser Val Phe Asp 
100 

Thr Leu Leu Tyr 
115 

Pro Tyr Thr Arg 
130 

Gly Gly His Asn 
145 

Pro Pro Phe Ala 

Gin Ser Pro lie 
180 

Tyr Val Ala He 
195 

Leu Tyr Arg Gin 
210 

Gin Trp Trp Lys 
225 



Thr Gly Leu Phe 
55 

Thr Thr Asp Asp 
70 

Pro Ser He Val 
85 

Gly Ser Val He 

Thr Ser Val Ser 
120 

Gly Ser Glu Thr 
135 

Phe Thr Lys Leu 
150 

Leu Asn Val Thr 
165 

Leu Asp Lys Ser 

Ser Gly Gly Val 
200 

Asn Asp Ala Asp 
215 

Glu Pro Leu Asn 
230 



His Val Gly Trp 
60 

Leu Val Thr Tyr 
75 

Ala Gly Gly Lys 
90 

Pro Ser Gly He 
105 

Tyr Leu Pro He 

Gin Ser Leu Ala 
140 

Asn Gin Gly Pro 
155 

Ala Phe Arg Asp 
170 

Val Asn Ser Thr 
185 

His Gly Val Gly 

Phe Gin Tyr Trp 
220 

Thr Thr Trp Gly 
235 
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Leu Tyr Ser Gly 

Lys Asp Leu Asn 
80 

Asn Asp Pro Leu 
95 

Asp Gly Met Pro 
110 

His Trp Ser He 
125 

Val Ser Tyr Asp 

Val He Pro Thr 
160 

Pro Tyr Val Phe 
175 

Gin Gly Thr Trp 
190 

Pro Cys Gin Phe 
205 

Glu Tyr Leu Gly 

Lys Gly Asp Trp 
240 



Ala Gly Gly Trp Gly Phe Asn Phe Glu Val Gly Asn Val Phe Ser Leu 

245 . 250 255 

Asn Ala Glu Gly Tyr Ser Glu Asp Gly Glu He Phe He Thr Leu Gly 

260 265 270 

Ala Glu Gly Ser Gly Leu Pro He Val Pro Gin Val Ser Ser He Arg 

275 280 285 

Asp Met Leu Trp Val Thr Gly Asn Val Thr Asn Asp Gly Ser Val Thr 

290 295 300 

Phe Lys Pro Thr Met Ala Gly Val Leu Asp Trp Gly Val Ser Ala Tyr 
305 310 315 320 

Ala Ala Ala Gly Lys He Leu Pro Ala Ser Ser Gin Ala Ser Thr Lys 

325 330 335 

Ser Gly Ala Pro Asp Arg Phe He Ser Tyr Val Trp Leu Thr Gly Asp 

340 345 350 

Leu Phe Glu Gin Val Lys Gly Phe Pro Thr Ala Gin Gin Asn Trp Thr 

355 360 365 

Gly Ala Leu Leu Leu Pro Arg Glu Leu Asn Val Arg Thr He Ser Asn 

370 375 380 

Val Val Asp Asn Glu Leu Ser Arg Glu Ser Leu Thr Ser Trp Arg Val 
385 390 395 400 

Ala Arg Glu Asp Ser Gly Gin He Asp Leu Glu Thr Met Gly He Ser 

405 410 415 

He Ser Arg Glu Thr Tyr Ser Ala Leu Thr Ser Gly Ser Ser Phe Val 
420 425 430 
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Glu Ser Gly Lys Thr Leu Ser Asn Ala Gly Ala Val Pro Phe Asn Thr 

435 440 445 

Ser Pro Ser Ser Lys Phe Phe Val Leu Thr Ala Asn lie Ser Phe Pro 

450 455 460 

Thr Ser Ala Arg Asp Ser Gly lie Gin Ala Gly Phe Gin Val Leu Ser 
465 470 475 480 

Ser Ser Leu Glu Ser Thr Thr lie Tyr Tyr Gin Phe Ser Asn Glu Ser 

485 490 495 

lie lie Val Asp Arg Ser Asn Thr Ser Ala Ala Ala Arg Thr Thr Ala 

500 505 510 

Gly lie Leu Ser Asp Asn Glu Ala Gly Arg Leu Arg Leu Phe Asp Val 

515 520 525 

Leu Arg Asn Gly Lys Glu Gin Val Glu Thr Leu Glu Leu Thr lie Val 
530 535 540 

Val Asp Asn Ser Val Leu Glu Val Tyr Ala Asn Gly Arg Phe Ala Leu 
545 550 555 560 

Gly Thr Trp Ala Arg 

565 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1695 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double stranded 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Penicillium roqueforti IAM7254 
(ix) FEATURE: 

(A) NAME /KEY : mat peptide 

(B) LOCATION: 1 .. 1695 

(C) IDENTIFICATION METHOD: E 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GTTGATTTCC 


ATACCCCGAT 


TGACTATAAC 


TCGGCTCCGC 


CAAACCTTTC 


TACCCTGGCA 


60 


AACGCATCTC 


TTTTCAAGAC 


ATGGAGACCC 


AGAGCCCATC 


TTCTCCCTCC 


ATCTGGGAAC 


120 


ATAGGCGACC 


CGTGCGGGCA 


CTATACCGAT 


CCCAAGACTG 


GTCTCTTCCA 


CGTGGGTTGG 


180 


CTTTACAGTG 


GGATTTCGGG 


AGCGACAACC 


GACGATCTCG 


TTACCTATAA 


AGACCTCAAT 


240 


CCCGATGGAG 


CCCCGTCAAT 


TGTTGCAGGA 


GGAAAGAACG 


ACCCTCTTTC 


TGTCTTCGAT 


300 


GGCTCGGTCA 


TTCCAAGCGG 


TATAGACGGC 


ATGCCAACTC 


TTCTGTATAC 


CTCTGTATCA 


360 


TACCTCCCAA 


TCCACTGGTC 


CATCCCCTAC 


ACCCGGGGAA 


GCGAGACACA 


ATCCTTGGCC 


420 


GTTTCCTATG 


ACGGTGGTCA 


CAACTTCACC 


AAGCTCAACC 


AAGGGCCCGT 


GATCCCTACG 


480 


CCTCCGTTTG 


CTCTCAATGT 


CACCGCTTTC 


CGTGACCCCT 


ACGTTTTCCA 


AAGCCCAATT 


540 


CTGGACAAAT 


CTGTCAATAG 


TACCCAAGGA 


ACATGGTATG 


TCGCCATATC 


TGGCGGTGTC 


600 


CACGGTGTCG 


GACCTTGTCA 


GTTCCTCTAC 


CGTCAGAACG 


ACGCAGATTT 


TCAATATTGG 


660 


GAATATCTCG 


GGCAATGGTG 


GAAGGAGCCC 


CTTAATACCA 


CTTGGGGAAA 


GGGTGACTGG 


720 


GCCGGGGGTT 


GGGGCTTCAA 


CTTTGAGGTT 


GGCAACGTCT 


TTAGTCTGAA 


TGCAGAGGGG 


780 


TATAGTGAAG 


ACGGCGAGAT 


ATTCATAACC 


CTCGGTGCTG 


AGGGTTCGGG 


ACTTCCCATC 


840 


GTTCCTCAAG 


TCTCCTCTAT 


TCGCGATATG 


CTGTGGGTGA 


CCGGCAATGT 


CACAAATGAC 


900 


GGCTCTGTCA 


CTTTCAAGCC 


AACCATGGCG 


GGTGTGCTTG 


ACTGGGGCGT 


GTCGGCATAT 


960 


GCTGCTGCAG 


GCAAGATCTT 


GCCGGCCAGC 


TCTCAGGCAT 


CCACAAAGAG 


CGGTGCCCCC 


1020 
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GATCGGTTCA TTTCCTATGT CTGGCTCACT GGAGATCTAT TCGAGCAAGT GAAAGGATTC 1080 

CCTACCGCTC AACAAAACTG GACCGGGGCC CTCTTACTGC CGCGAGAGCT GAATGTCCGC 1140 

ACTATCTCTA ACGTGGTGGA TAACGAACTT TCGCGTGAGT CCTTGACATC GTGGCGCGTG 1200 

GCCCGCGAAG ACTCTGGTCA GATCGACCTT GAAACAATGG GAATCTCAAT TTCCAGGGAG 1260 

ACTTACAGCG CTCTCACATC CGGCTCATCT TTTGTCGAGT CTGGTAAAAC GTTGTCGAAT 1320 

GCTGGAGCAG TGCCCTTCAA TACCTCACCC TCAAGCAAGT TCTTCGTGCT GACAGCAAAT 1380 

ATATCTTTCC CGACCTCTGC CCGTGACTCT GGCATCCAGG CTGGTTTCCA GGTTTTATCC 1440 

TCTAGTCTTG AGTCTACAAC TATCTACTAC CAATTCTCCA ACG AGTC CAT CATCGTCGAC 1500 

CGCAGCAACA CGAGTG CTGC GGCGAGAACA ACTGCTGGGA TCCTCAGTGA TAACGAGGCG 1560 

GGACGTCTGC GCCTCTTCGA CGTGTTGCGA AATGGAAAAG AACAGGTTGA AACTTTGGAG 1620 

CTCACTATCG TGGTGGATAA TAGTGTACTG GAAGTATATG CCAATGGACG CTTTGCTCTA 1680 

GGCACTTGGG CTCGG 1695 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Not relevant 

(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Scopulariopsis brevicaulis IF04843 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1 . . 574 
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(C) IDENTIFICATION METHOD: E 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Gin Pro Thr Ser Leu Ser He Asp Asn Ser Thr Tyr Pro Ser He Asp 
1 5 10 15 

Tyr Asn Ser Ala Pro Pro Asn Leu Ser Thr Leu Ala Asn Asn Ser Leu 

20 25 30 

Phe Glu Thr Trp Arg Pro Arg Ala His Val Leu Pro Pro Gin Asn Gin 
35 40 45 

O He Gly Asp Pro Cys Met His Tyr Thr Asp Pro Glu Thr Gly He Phe 
S 50 55 60 

B His Val Gly Trp Leu Tyr Asn Gly Asn Gly Ala Ser Gly Ala Thr Thr 
it 65 70 75 80 

Glu Asp Leu Val Thr Tyr Gin Asp Leu Asn Pro Asp Gly Ala Gin Met 

85 90 95 

'S He Leu Pro Gly Gly Val Asn Asp Pro He Ala Val Phe Asp Gly Ala 
13 100 105 no 

Val He Pro Ser Gly He Asp Gly Lys Pro Thr Met Met Tyr Thr Ser 

115 120 125 

Val Ser Tyr Met Pro He Ser Trp Ser He Ala Tyr Thr Arg Gly Ser 

130 135 140 

Glu Thr His Ser Leu Ala Val Ser Ser Asp Gly Gly Lys Asn Phe Thr 
145 150 155 160 

Lys Leu Val Gin Gly Pro Val He Pro Ser Pro Pro Phe Gly Ala Asn 

165 170 175 
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Val Thr Ser Trp Arg Asp Pro Phe Leu Phe Gin Asn Pro Gin Phe Asp 

180 185 190 

Ser Leu Leu Glu Ser Glu Asn Gly Thr Trp Tyr Thr Val lie Ser Gly 

195 200 205 

Gly lie His Gly Asp Gly Pro Ser Ala Phe Leu Tyr Arg Gin His Asp 

210 215 220 

Pro Asp Phe Gin Tyr Trp Glu Tyr Leu Gly Pro Trp Trp Asn Glu Glu 
225 230 235 240 

Gly Asn Ser Thr Trp Gly Ser Gly Asp Trp Ala Gly Arg Trp Gly Tyr 

245 250 255 

Asn Phe Glu Val lie Asn lie Val Gly Leu Asp Asp Asp Gly Tyr Asn 

260 265 270 

Pro Asp Gly Glu He Phe Ala Thr Val Gly Thr Glu Trp Ser Phe Asp 

275 280 285 

Pro He Lys Pro Gin Ala Ser Asp Asn Arg Glu Met Leu Trp Ala Ala 

290 295 300 

Gly Asn Met Thr Leu Glu Asp Gly Asp He Lys Phe Thr Pro Ser Met 
305 310 315 320 

Ala Gly Tyr Leu Asp Trp Gly Leu Ser Ala Tyr Ala Ala Ala Gly Lys 

325 330 335 

Glu Leu Pro Ala Ser Ser Lys Pro Ser Gin Lys Ser Gly Ala Pro Asp 

340 345 350 

Arg Phe Val Ser Tyr Leu Trp Leu Thr Gly Asp Tyr Phe Glu Gly His 
355 360 365 
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Asp Phe Pro Thr 
370 

Arg Glu Leu Ser 
385 

Ala Arg Glu Thr 

Leu Glu Leu Val 
420 

Glu Met Thr Ser 
435 

Ser Pro Gly Ser 
450 

Val Leu Thr Ala 
465 

Leu Lys Ala Gly 

Val Tyr Tyr Gin 
500 

Ser Ser Ala Ala 
515 

Phe Gly Lys Met 
530 

lie Glu Thr Leu 
545 



Pro Gin Gin Asn 
375 

Val Gly Thr lie 
390 

Gly Ser Trp Arg 
405 

Thr Leu Lys Gin 

Gly Asn Ser Phe 
440 

Thr Ala Phe Gin 
455 

Ser Leu Ser Phe 
470 

Phe Glu lie Leu 
485 

Phe Ser Asn Glu 

Ala Leu Thr Thr 
520 

Arg Leu Phe Asp 
535 

Asp Leu Thr lie 
550 



Trp Thr Gly Ser 
380 

Pro Asn Val Val 
395 

Val Gly Thr Asn 
410 

Glu lie Ala Arg 
425 

Thr Glu Ala Ser 

Gin Ser Leu Asp 
460 

Pro Ser Ser Ala 
475 

Ser Ser Glu Phe 
490 

Ser lie lie lie 
505 

Asp Gly lie Asp 

Val Val Glu Gly 
540 

Val Val Asp Asn 
555 



Leu Leu Leu Pro 

Asp Asn Glu Leu 
400 

Asp Thr Gly Val 
415 

Glu Thr Leu Ala 
430 

Arg Asn Val Ser 
445 

Ser Lys Phe Phe 

Arg Asp Ser Asp 
480 

Glu Ser Thr Thr 
495 

Asp Arg Ser Asn 
510 

Thr Arg Asn Glu 
525 

Asp Gin Glu Arg 

Ser He Val Glu 
560 
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Val His Ala Asn Gly Arg Phe Ala Leu Ser Thr Trp Val Arg 

565 570 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double stranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Scopulariopsis brevicaulis IF04843 
(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1 .. 1722 

(C) IDENTIFICATION METHOD: E 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CAACCTACGT CTCTGTCAAT CGACAATTCC ACGTATCCTT CTATCGACTA CAACTCCGCC 60 

CCTCCAAACC TCTCGACTCT TGCCAACAAC AGCCTCTTCG AGACATGGAG GCCGAGGGCA 120 

CACGTCCTTC CGCCCCAGAA CCAGATCGGC GATCCGTGTA TGCACTACAC CGACCCCGAG 180 

ACAGGAATCT TCCACGTCGG CTGGCTGTAC AACGGCAATG GCGCTTCCGG CGCCACGACC 240 

GAGGATCTCG TCACCTATCA GGATCTCAAC CCCGACGGAG CGCAGATGAT CCTTCCGGGT 3 00 

GGTGTGAATG ACCCCATTGC TGTCTTTGAC GGCGCGGTTA TTCCCAGTGG CATTGATGGG 360 

AAACCCACCA TGATGTATAC CTCGGTGTCA TACATGCCCA TCTCCTGGAG CATCGCTTAC 420 

ACCAGGGGAA GCGAGACCCA CTCTCTCGCA GTGTCGTCCG ACGGCGGTAA GAACTTCACC 480 
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AAGCTGGTGC AGGGCCCCGT CATTCCTTCG CCTCCCTTCG GCGCCAACGT GACCAGCTGG 540 

CGTGACCCCT TCCTGTTCCA AAACCCCCAG TTCGACTCTC TCCTCGAAAG CGAGAACGGC 600 

ACGTGGTACA CCGTTATCTC TGGTGGCATC CACGGTGACG GCCCCTCCGC GTTCCTCTAC 660 

CGTCAGCACG ACCCCGACTT CCAGTACTGG GAGTACCTTG GACCGTGGTG GAACGAGGAA 720 

GGGAACTCGA CCTGGGGCAG CGGTGACTGG GCTGGCCGGT GGGGCTACAA CTTCGAGGTC 780 

ATCAACATTG TCGGTCTTGA CGATGATGGC TACAACCCCG ACGGTGAAAT CTTTGCCACG 840 

GTAGGTACCG AATGGTCGTT TGACCCCATC AAACCGCAGG CCTCGGACAA CAGGGAGATG 900 

CTCTGGGCCG CGGGCAACAT GACTCTCGAG GACGGCGATA TCAAGTTCAC GCCAAGCATG 960 

GCGGGCTACC TCGACTGGGG TCTATCGGCG TATGCCGCCG CTGGCAAGGA GCTGCCCGCT 1020 

TCTTCAAAGC CTTCGCAGAA GAGCGGTGCG CCGGACCGGT TCGTGTCGTA CCTGTGGCTC 1080 

ACCGGTGACT ACTTCGAGGG CCACGACTTC CCCACCCCGC AGCAGAATTG GACCGGCTCG 1140 

CTTTTGCTTC CGCGTGAGCT GAGCGTCGGG ACGATTCCCA ACGTTGTCGA CAACGAGCTT 1200 

GCTCGCGAGA CGGGCTCTTG GAGGGTTGGC ACCAACGACA CTGGCGTGCT TGAGCTGGTC 1260 

ACTCTGAAGC AGGAGATTGC TCGCGAGACG CTGGCTGAAA TGACCAGCGG CAACTCCTTC 132 0 

ACCGAGGCGA GCAGGAATGT CAGCTCGCCC GGATCTACCG CCTTCCAGCA GTCCCTGGAT 1380 

TCCAAGTTCT TCGTCCTGAC CGCCTCGCTC TCCTTCCCTT CGTCGGCTCG CGACTCCGAC 1440 

CTCAAGGCTG GTTTCGAGAT CCTGTCGTCC GAGTTTGAGT CGACCACGGT CTACTACCAG 1500 

TTTTCCAACG AGTCCATCAT CATTGACCGG AGCAACTCGA GTGCTGCCGC CTTGACTACC 1560 

GATGGAATCG ACACCCGCAA CGAGTTTGGC AAGATGCGCC TGTTTGATGT TGTCGAGGGT 1620 

G A C C AG GAG C GTATCGAGAC GCTCGATCTC ACTATTGTGG TTGATAACTC GATCGTTGAG 1680 

GTTCATGCCA ACGGGCGATT CGCTCTGAGC ACTTGGGTTC GG 1722 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
G CGAATTCC A ATGAAGCTCA CCACTACC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GCGGATCCCG GTCAATTTCT CTCC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GACTGACCGG TGTTCATCC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CTCGGTTGTC ATAGATGTGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAATCCAGGA GGATCCCAAT GAAG 



(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
TGACCGGGAT CCGGGCATGC AG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CGCGTCGTCT AGAGGTTGTC ACTT 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CCCTATTGGG GTCCATGGCC C 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
CAACTGCTGG CATCCTCAGT GA 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GCGGATCCAT GAAGCTATCA AATGCAATCA 



(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 6 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GCGGATCCTT ACCGAGCCCA AGTGCC 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

GCGGATCCAA TGAAGCTCAC CACTACC 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
GCGGATCCCG GTCAATTTCT CTCC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

GTCACCGCCT GGCGCGATCC G 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGCACGGAGT GGTCTGGCC 



(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CTCCAGTATC AAGGATATGC TGTG 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

CGACCAGTAC AAGCAGGCGG 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
TCCAGTATCC GCGATATGCT G 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

CGGCACGGAG GTTTCTGGCC TGC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

CGGCACGGAG GAGTCTGGCC TGC 



(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
CGGCACGGAG GATTCTGGCC TGC 
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